A Forward Scan based Plane Sweep Algorithm for Parallel Interval Joins

نویسندگان

  • Panagiotis Bouros
  • Nikos Mamoulis
چکیده

The interval join is a basic operation that finds application in temporal, spatial, and uncertain databases. Although a number of centralized and distributed algorithms have been proposed for the efficient evaluation of interval joins, classic plane sweep approaches have not been considered at their full potential. A recent piece of related work proposes an optimized approach based on plane sweep for modern hardware, showing that it greatly outperforms previous work. However, this approach depends on the development of a complex data structure and its parallelization has not been adequately studied. In this paper, we explore the applicability of a largely ignored forward scan (FS) based plane sweep algorithm, which is extremely simple to implement. We proposed two novel optimized versions of FS that greatly reduce its cost, making it competitive to the state-of-the-art single-threaded algorithm. In addition, we show the drawbacks of a previously proposed hash-based partitioning approach for parallel join processing and suggest a domain-based partitioning approach that does not produce duplicate results. Within our approach we propose a novel breakdown of the partition join jobs into a small number of independent mini-join jobs with varying cost and manage to avoid redundant comparisons. Finally, we show how these mini joins can be scheduled in a smaller number of CPU cores and propose an adaptive domain partitioning, aiming at load balancing. We include an experimental study that demonstrates the efficiency of our optimized FS and the scalability of our parallelization framework.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sweep Line Algorithm for Convex Hull Revisited

Convex hull of some given points is the intersection of all convex sets containing them. It is used as primary structure in many other problems in computational geometry and other areas like image processing, model identification, geographical data systems, and triangular computation of a set of points and so on. Computing the convex hull of a set of point is one of the most fundamental and imp...

متن کامل

Variations on Sweep Algorithms : e cient computation of extended viewsheds and class intervals

Two novel applications of the plane sweep paradigm are demonstrated, namely, for the computation of extended viewsheds on gridded DEMs and for class interval selection on TIN-based DEMs. In both cases, the e ciency of the plane sweep algorithm is signi cantly better than a straightforward approach. The algorithms are presented by rst giving the plane sweep method as a general approach that requ...

متن کامل

Design and Kinematic Analysis of a 4-DOF Serial-Parallel Manipulator for a Driving Simulator

This paper presents the kinematic analysis and the development of a 4-degree-of-freedom serial-parallel mechanism for large commercial vehicle driving simulators. The degrees of freedom are selected according to the target maneuvers and the structure of human motion perception organs. Several kinematic properties of parallel part of the mechanism under study are investigated, including the inve...

متن کامل

Sweeping an oval to a vanishing point

Given a convex region in the plane, and a sweep-line as a tool, what is best way to reduce the region to a single point by a sequence of sweeps? The problem of sweeping points by orthogonal sweeps was first studied in [2]. Here we consider the following slanted variant of sweeping recently introduced in [1]: In a single sweep, the sweep-line is placed at a start position somewhere in the plane,...

متن کامل

Efficient Self-Join Algorithm in Interval-based Temporal Data Models

Interval-based temporal data model is a popular data model in temporal databases. It uses time intervals for representing the period of validity of a tuple, leading to unavoidable self-joins when combining tuples for objects. It requires k + 1-way self-join for k conjunctive conditions. Join operations are one of the most expensive operations in databases and they are even more serious in tempo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • PVLDB

دوره 10  شماره 

صفحات  -

تاریخ انتشار 2017